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PRIMARY -EXAMINER : ShawWJareth D. 

ASSISTANT- EXAMINER: Ku^t; Paul 
ATTY- AGENT- FIRM: Falk; James W. 

ABSTRACT : 

A method and apparatus for performing storage and retrieval in an information storage 
system is disclosed which uses the hashing technique. In order to prevent contamination 
of the storage medium by deleted records, a hybrid hashing technique is used which uses a 
fast, contaminating deletion of records during times of heavy load on the system, but 
uses a slow, non-contaminating deletion when the load on the system is not as heavy. The 
slow, non-contaminating deletion automatically removes previously generated contamination 
in the vicinity of the slow, non-contaminating deletion, thereby automatically 
decontaminating the storage space. Because no long term contamination can build up in the 
present system, it is useful for large data bases which are heavily used and which 
require the fast access provided by hashing. 
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ASSISTANT- EXAMINER: Kulik; Paul 
ATTY- AGENT- FIRM: Falk; James W. 

ABSTRACT : 

A method and apparatus for performing storage and retrieval in an information storage 
system is disclosed which uses the hashing technique. In order to prevent contamination 
of the storage medium by automatically expiring records, a garbage collection technique 
is used which removes all expired records in the neighborhood of a probe into the data 
storge system. More particularly, each probe for insertion, retrieval or deletion of a 
record is an occasion to search the entire chain of records found for expired records and 
then removing them and closing the chain. This garbage collection automatically removes 
expired record contamination in the vicinity of the probe, thereby automatically 
decontaminating the storage space. Because no long term contamination can build up in the 
present system, it is useful for large data bases which are heavily used and which 
require the fast access provided by hashing. 
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pp. 733-738, Aug. 1995. 
ART-UNIT: 237 

PRIMARY- EXAMINER: Black; Thomas G. 
ASSISTANT -EXAMINER : Corrielus; Jean M. 
ATTY- AGENT-FIRM: Cesari & McKenna, LLP 

ABSTRACT : 

A method for rapidly hashing records in a large database stored on a secondary storage 
device in which a set of memory-blocks are preferably established in main memory for 
receiving information. Each memory block is associated with a sub-range of hash values 
that collectively span a range of hash values derived from one or more fields of the 
database records. The hash values together with other information are distributed among 
the memory-blocks in accordance with the range of hash values. As each memory block 
fills, its contents are written to an intermediate file associated with the memory-block 
in secondary storage. The intermediate files are subsequently retrieved and the hash 
values stored therein are ordered. The ordered intermediate files are then written to 
secondary storage as a single hash table spanning the entire database. 
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Data supplied to a data storage system by a host processor has one of two use status. A 
first use status is that the supplied data is to be retentively stored in the data 
storage system. A second use status is that the supplied data is exempted from the 
retentive storage requirement. An example of exempted use status is that data only 
temporarily stored in the data storage system, i.e. is transitory. A second example is 
data that is being manipulated prior to retentive storage, data that is temporarily 
volatile. Termination of the exempted use status results in either discard or a retentive 
storage of the exempted use data. Data integrity controls for the exempted use status 
data are described. The invention is described for a data storage hierarchy environment 
having a volatile cache and a magnetic recorder as a backing store. The exempted use data 
need be stored only in or primarily in the cache while retentive data is primarily stored 
in the retentive store and selectively in the cache. 
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ATTY- AGENT-FIRM: Brodie; R. Bruce Klein; Ester E. 
ABSTRACT : 



A method for managing a cache hierarchy having a fixed total storage capacity is 
disclosed. The cache hierarchy is logically partitioned to form a least recently used 
(LRU) global cache and a plurality of LRU destaging local caches. The global cache stores 
objects of all types and maintains them in LRU order. In contrast, each local cache is 
bound to objects having a unique data type T(i), where i is indicative of a DataType. 
Read and write accesses by referencing processors or central processing units (CPU ! s) are 
made to the global cache. Data not available in the global cache is staged thereto either 
from one of the local caches or from external storage. When a cache full condition is 
reached, placement of the most recently used (MRU) data element to the top of the global 
cache results in an LRU data element of type T(i) being destaged from the global cache to 
a corresponding one of the local caches storing type T(i) data. Likewise, when a cache 
full condition is reached in any one or more of the local caches, the local caches in 
turn will destage their LRU data elements to external storage. The parameters defining 
the partitions are externally supplied. 
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ART-UNIT: 237 

PRIMARY-EXAMINER: Black; Thomas G . 
ASSISTANT -EXAMINER: Von Buhr; Maria N. 

ATTY- AGENT- FIRM: Allen; Kenneth R. Townsend and Townsend and Crew LLP 
ABSTRACT : 

In an object-oriented application being executed in a digital computing system comprising 
a processor, a method and apparatus are provided for managing information retrieved from 
a structured database, such as a relational database, wherein the processor is used to 
construct a plurality of object instances, each of these object instances having its own 
unique object ID that provides a mapping between the object instance and at least one row 
in the structured database. The processor is used to construct a single cohesive data 
structure, called an object cache, that comprises all the object instances and that 
represents information retrieved from the structured database in a form suitable for use 
by one or more object-oriented applications. A mechanism for managing the object cache is 
provided that has these three properties: First, through a technique called key 
swizzling, it uses explicit relationship pointers between object instances in the object 
cache to reduce the volume of queries to the structured database. Second, it ensures that 
only one copy of an object instance is in the cache at any given time, even if several 
different queries return the same information from the database. Third, the mechanism 
guarantees the integrity of data in the cache by locking data appropriately in the 
structured database during a database transaction, flushing cache data at the end of each 
transaction, and transparently re-reading the data and reacquiring the appropriate locks 
for an object instance whose data has been flushed. 
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A high-speed cache is shared by a plurality of independently-operating data systems in a 
multi- system data sharing complex. Each data system has access both to the high-speed 
cache and the lower- speed, secondary storage for obtaining and storing data. Management 
logic and the high-speed cache assures that a block of data obtained form the cache for 
entry into the secondary storage will be consistent with the version of the block of data 
in the shared cache with non-blocking serialization allowing access to a changed version 
in the cache while castout is being performed. Castout classes are provided to facilitate 
efficient movement from the shared cache to DASD. 
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created. The access function of the database access library is mapped to the database 
table. A data object is also created. The data object has a plurality of data attributes 
Each data attribute of the data object is mapped to the database table using the access 
function. Accessing of the data object is independent of the structure and syntax of the 
database table. 
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A CPU implementable method for minimizing thrashing among concurrent processes demand 
page accessing a data base through an LRU page organized buffer pool. There is 
ascertained the set of pages over which there is looping access behavior for the 
prospectively executing concurrent processes. This parameter, as determined for each 
task, is passed to the storage accessing component which partitions the buffer into LRU 
stacks and dynamically adjusts the stack to this predicted parameter size. 
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The present invention is a system, method, and computer program product for dynamically 
sizing a hash table when the average number of records per bucket in the hash table 
exceeds a maximum average number of records per bucket. In one embodiment, the hash table 
employs a modulo hashing function. In a second embodiment, the number of buckets is grown 
by a multiple of the previous number of buckets and records are re-hashed usinq a lazv 
re-hashing modulo algorithm that re-hashes records in a hash bucket only when those 
records are searched. In the second embodiment, when a hash table is re-sized, each new 
bucket is provided with a logical back pointer, or index, to a pre-existing bucket that 
potentially contains records that belong in the new bucket. When a search is directed at 
a new bucket, the logical back pointer, or index, directs the search to a pre-existinq 
bucket. When a search of a pre-existing bucket finds a data record that belongs in a new 
bucket, the record is moved to the new bucket. When there are no more records in a 
pre-existing bucket that belong in a new bucket, the logical back pointer from the new 
bucket to the pre-existing bucket is removed. Preferably, the logical back pointer is 
stored in the bucket where a regular pointer would normally be stored so that no extra 
space is needed. 
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ABSTRACT : 

A method and apparatus for performing storage and retrieval in an information storage 
system is disclosed that uses the hashing technique with the external chaining method for 
collision resolution. In order to prevent performance deterioration due to the presence 
of automatically expiring data items, a garbage collection technique is used that removes 
all expired records stored in the system in the external chain targeted by a probe into 
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the data storage systemWore particularly, each insertion»etrieval , or deletion of 1 
record is an occasion to search an entire linked- list chain of records for expired items 
and then remove them. Because an expired data item will not remain in the system lona 
term if the system is frequently probed, it is useful for large information storacre 
systems that are heavily used, require the fast access provided by hashing, and cannot be 
taken off-line for removal of expired data. c De 
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ABSTRACT : 



A method for rapidly hashing records in a large database stored on a secondary storage 
device in which a set of memory-blocks are preferably established in main memory for 
receiving information. Each memory block is associated with a sub-range of hash values 
that collectively span a range of hash values derived from one or morffields of tne 
database records. The hash values together with other information are distributed amono 
the memory-blocks in accordance with the range of hash values. As each memory block 9 
fills, its contents are written to an intermediate file associated with the memory-block 
in secondary storage. The intermediate files are subsequently retrieved and the Ssh 
values stored therein are ordered. The ordered intermediate files are then written to 
secondary storage as a single hash table spanning the entire database. wrltcen to 
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